Evalita-istc Comparison of Open Source Tools on Clean and Noisy Digits Recognition Tasks
نویسندگان
چکیده
1. ABSTRACT EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. The general objective of EVALITA is to promote the development of language and speech technologies for the Italian language, providing a shared framework where different systems and approaches can be evaluated in a consistent manner. In this work the results of the evaluation of three open source ASR toolkits (CSLU Speech Tools, CSLR SONIC, SPHINX) working on the EVALITA clean and noisy digits recognition task will be described together with the complete evaluation methodology.
منابع مشابه
Connected Digits Recognition Task: ISTC–CNR Comparison of Open Source Tools
EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. In this work, the results of three open source ASR toolkits will be described. CSLU Speech Tools, CSLR SONIC, CMU SPHINX are applied on the EVALITA clean and noisy digits recognition task and this report will describe the complete evaluation methodology. CSLR SONIC has resulted ...
متن کاملComparison of Standard and Hybrid Modeling Techniques for Distributed Speech Recognition
Distributed speech recognition (DSR) is an interesting technology for mobile recognition tasks where the recognizer is split up into two parts and connected with a transmission channel. We compare the performance of standard and hybrid modeling approaches in this environment. The evaluation is done on clean and noisy speech samples taken from the TI digits and the AURORA database. Our results s...
متن کاملNoise Robust Music Artist Recognition Using I-Vector Features
In music information retrieval (MIR), dealing with different types of noise is important and the MIR models are frequently used in noisy environments such as live performances. Recently, i-vector features have shown great promise for some major tasks in MIR, such as music similarity and artist recognition. In this paper, we introduce a novel noise-robust music artist recognition system using i-...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملEVALITA 2009: Description and Results of the Speech Recognition task
In this paper, we describe motivations and features of the Speech Recognition task at EVALITA 2009. Systems are compared about the performance on the recognition of uttered connected digits sequences. Interesting results will be shown for various types of approaches, ranging from classic algorithms, to non-standard models. Commercial as well as prototype speech recognizers have been involved in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010